-
Notifications
You must be signed in to change notification settings - Fork 61
feat: add eval job that runs on CI #1167
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…e-agent-sdk into add-eval-job-run-on-CI
|
[Automatic Post]: It has been a while since there was any activity on this PR. @simonrosenberg, are you still working on it? If so, please go ahead, if not then please request review, close it, or request that someone else follow up. |
|
@OpenHands /codereview-roasted |
|
I'm on it! simonrosenberg can track my progress at all-hands.dev |
Code Review Summary: PR #1167 - Eval Workflow IntegrationI've completed a comprehensive "Linus-style" code review of the PR that adds evaluation job triggering to the software-agent-sdk CI pipeline. Here's what I found: How the Workflow WorksThe PR adds a GitHub Actions workflow (
🔴 CRITICAL ISSUES (Must Fix)1. Security Risk: Using
|
|
Looks like there are a few issues preventing this PR from being merged!
If you'd like me to help, just leave a comment, like Feel free to include any additional details that might help me get this PR into a better state. You can manage your notification settings |
Agent Server images for this PR
• GHCR package: https://github.com/OpenHands/agent-sdk/pkgs/container/agent-server
Variants & Base Images
eclipse-temurin:17-jdknikolaik/python-nodejs:python3.12-nodejs22golang:1.21-bookwormPull (multi-arch manifest)
# Each variant is a multi-arch manifest supporting both amd64 and arm64 docker pull ghcr.io/openhands/agent-server:898eb1c-pythonRun
All tags pushed for this build
About Multi-Architecture Support
898eb1c-python) is a multi-arch manifest supporting both amd64 and arm64898eb1c-python-amd64) are also available if needed